Node Classification in Graph Data using Augmented Random Walk
نویسندگان
چکیده
Node classification in graph data plays an important role in web mining applications. We classify the existing node classifiers into Inductive and Transductive approaches. Among the Transductive methods, the Majority Rule method (MRM) has a prominent role. This method considers only the class labels of the neighboring nodes, neglecting the informative connectivity information in the graph data. In this paper, we propose an Augmented Random Walk (ARW) based approach to resolve main limitations of MRM. In our proposed method, first, we augment the initial graph by adding class labels as new nodes to the graph and then we connect each classified node to its corresponding class label nodes. Second, we apply a Random Walk algorithm to find the similarity score of each un-classified node to different class labels. Third, we predict class labels with the highest scores for the un-classified node. Empirical results show that our proposed method clearly outperforms the Majority Rule method in six graph datasets with high homophily.
منابع مشابه
Evaluating the Quality of a Network Topology through Random Walks
In this brief announcement we propose a distributed algorithm to assess the connectivity quality of a network, be it physical or logical. In large complex networks, some nodes may play a vital role due to their position (e.g. for routing or network reliability). Assessing global properties of a graph, as importance of nodes, usually involves lots of communications; doing so while keeping the ov...
متن کاملEfficient algorithms for topology control and information dissemination/ retrieval in large scale Wireless Sensor Networks
Wireless Sensor Networks (WSNs) require radically new approaches for protocol/ algorithm design, with a focus towards energy efficiency at the node level. We propose two algorithms for energy-efficient, distributed clustering called Directed Budget Based (DBB) and Directed Budget Based with Random Delays (DBB-RD). Both algorithms improve clustering performance and overall network decomposition ...
متن کاملOn the Embeddability of Random Walk Distances
Analysis of large graphs is critical to the ongoing growth of search engines and social networks. One class of queries centers around node affinity, often quantified by random-walk distances between node pairs, including hitting time, commute time, and personalized PageRank (PPR). Despite the potential of these “metrics,” they are rarely, if ever, used in practice, largely due to extremely high...
متن کاملLPKP: location-based probabilistic key pre-distribution scheme for large-scale wireless sensor networks using graph coloring
Communication security of wireless sensor networks is achieved using cryptographic keys assigned to the nodes. Due to resource constraints in such networks, random key pre-distribution schemes are of high interest. Although in most of these schemes no location information is considered, there are scenarios that location information can be obtained by nodes after their deployment. In this paper,...
متن کاملEnhanced Random Walk with Choice: An Empirical Study
The random walk with choice is a well known variation to the random walk that first selects a subset of d neighbours nodes and then decides to move to the node which maximizes the value of a certain metric; this metric captures the number of (past) visits of the walk to the node. In this paper we propose an enhancement to the random walk with choice by considering a new metric that captures not...
متن کامل